Overview

Dataset Statistics

Number of Variables 26
Number of Rows 16009
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 4.4 MB
Average Row Size in Memory 288.2 B
Variable Types
  • Categorical: 5
  • Numerical: 21

Dataset Insights

rent and return have similar distributions Similar Distribution
rent and use have similar distributions Similar Distribution
return and use have similar distributions Similar Distribution
rent is skewed Skewed
return is skewed Skewed
in_rate is skewed Skewed
out_rate is skewed Skewed
use is skewed Skewed
moment is skewed Skewed
carbon is skewed Skewed
distance is skewed Skewed
time is skewed Skewed
code_단체권 is skewed Skewed
code_일일권 is skewed Skewed
code_정기권 is skewed Skewed
sex_F is skewed Skewed
sex_M is skewed Skewed
sex_N is skewed Skewed
age_20대 is skewed Skewed
age_30대 is skewed Skewed
age_40대 is skewed Skewed
age_50대 is skewed Skewed
age_~10대 is skewed Skewed
age_기타 is skewed Skewed
date has constant length 6 Constant Length
code_일일권(비회원) has constant length 1 Constant Length
age_60대 has constant length 1 Constant Length
age_70대이상 has constant length 1 Constant Length
code_단체권 has 2926 (18.28%) zeros Zeros
  • 1
  • 2
  • 3

Variables


gu

categorical

Approximate Distinct Count 25
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 1.5 MB

Length

Mean 3.0826
Standard Deviation 0.3741
Median 3
Minimum 2
Maximum 4

Sample

1st row 송파구
2nd row 용산구
3rd row 양천구
4th row 강남구
5th row 양천구

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0

date

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.1 MB

Length

Mean 6
Standard Deviation 0
Median 6
Minimum 6
Maximum 6

Sample

1st row 202209
2nd row 202212
3rd row 202209
4th row 202207
5th row 202210

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 96054
  • date has words of constant length

rent

numerical

Approximate Distinct Count 4072
Approximate Unique (%) 25.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 1408.2078
Minimum 1
Maximum 20470
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • rent is skewed right (γ1 = 3.1136)

Quantile Statistics

Minimum 1
5-th Percentile 163
Q1 511
Median 1026
Q3 1874
95-th Percentile 3902.8
Maximum 20470
Range 20469
IQR 1363

Descriptive Statistics

Mean 1408.2078
Standard Deviation 1362.0927
Variance 1.8553e+06
Sum 2.2544e+07
Skewness 3.1136
Kurtosis 20.1884
Coefficient of Variation 0.9673
  • rent is not normally distributed (p-value 2.2782027588135947e-11)
  • rent has 790 outliers

return

numerical

Approximate Distinct Count 4090
Approximate Unique (%) 25.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 1402.6909
Minimum 1
Maximum 22249
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • return is skewed right (γ1 = 3.0481)

Quantile Statistics

Minimum 1
5-th Percentile 112
Q1 456
Median 1010
Q3 1880
95-th Percentile 3995.6
Maximum 22249
Range 22248
IQR 1424

Descriptive Statistics

Mean 1402.6909
Standard Deviation 1413.5858
Variance 1.9982e+06
Sum 2.2456e+07
Skewness 3.0481
Kurtosis 19.3326
Coefficient of Variation 1.0078
  • return is not normally distributed (p-value 1.356737414919634e-12)
  • return has 790 outliers

in_rate

numerical

Approximate Distinct Count 163
Approximate Unique (%) 1.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 0.9548
Minimum 0.04
Maximum 4.37
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • in_rate is skewed left (γ1 = -1.295)

Quantile Statistics

Minimum 0.04
5-th Percentile 0.56
Q1 0.94
Median 1
Q3 1.03
95-th Percentile 1.15
Maximum 4.37
Range 4.33
IQR 0.09

Descriptive Statistics

Mean 0.9548
Standard Deviation 0.182
Variance 0.03312
Sum 15285.02
Skewness -1.295
Kurtosis 12.8377
Coefficient of Variation 0.1906
  • in_rate is not normally distributed (p-value 3.0054436726577303e-18)
  • in_rate has 2740 outliers

out_rate

numerical

Approximate Distinct Count 435
Approximate Unique (%) 2.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 1.1433
Minimum 0.23
Maximum 26.1
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • out_rate is skewed right (γ1 = 11.8276)

Quantile Statistics

Minimum 0.23
5-th Percentile 0.87
Q1 0.97
Median 1
Q3 1.07
95-th Percentile 1.79
Maximum 26.1
Range 25.87
IQR 0.1

Descriptive Statistics

Mean 1.1433
Standard Deviation 0.6935
Variance 0.4809
Sum 18302.29
Skewness 11.8276
Kurtosis 241.0364
Coefficient of Variation 0.6066
  • out_rate is not normally distributed (p-value 6.179302670528675e-25)
  • out_rate has 2501 outliers

use

numerical

Approximate Distinct Count 4080
Approximate Unique (%) 25.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 1408.3593
Minimum 1
Maximum 20470
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • use is skewed right (γ1 = 3.113)

Quantile Statistics

Minimum 1
5-th Percentile 162
Q1 511
Median 1027
Q3 1875
95-th Percentile 3902.8
Maximum 20470
Range 20469
IQR 1364

Descriptive Statistics

Mean 1408.3593
Standard Deviation 1362.2881
Variance 1.8558e+06
Sum 2.2546e+07
Skewness 3.113
Kurtosis 20.1801
Coefficient of Variation 0.9673
  • use is not normally distributed (p-value 2.2804184128116612e-11)
  • use has 789 outliers

moment

numerical

Approximate Distinct Count 15663
Approximate Unique (%) 97.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 2299.0293
Minimum 47.32
Maximum 2.223e+06
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • moment is skewed right (γ1 = 80.6569)

Quantile Statistics

Minimum 47.32
5-th Percentile 366.716
Q1 863.46
Median 1530.71
Q3 2578.92
95-th Percentile 4953.464
Maximum 2.223e+06
Range 2.2229e+06
IQR 1715.46

Descriptive Statistics

Mean 2299.0293
Standard Deviation 23426.8271
Variance 5.4882e+08
Sum 3.6805e+07
Skewness 80.6569
Kurtosis 6898.752
Coefficient of Variation 10.1899
  • moment is not normally distributed (p-value 4.226517490214443e-25)
  • moment has 706 outliers

carbon

numerical

Approximate Distinct Count 4367
Approximate Unique (%) 27.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 17.2173
Minimum 0.44
Maximum 350.02
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • carbon is skewed right (γ1 = 4.7856)

Quantile Statistics

Minimum 0.44
5-th Percentile 3.15
Q1 7.45
Median 13.3
Q3 22.44
95-th Percentile 43.216
Maximum 350.02
Range 349.58
IQR 14.99

Descriptive Statistics

Mean 17.2173
Standard Deviation 15.6372
Variance 244.5209
Sum 275632.23
Skewness 4.7856
Kurtosis 56.074
Coefficient of Variation 0.9082
  • carbon is not normally distributed (p-value 5.411034183675233e-14)
  • carbon has 707 outliers

distance

numerical

Approximate Distinct Count 16002
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 74213.1063
Minimum 1911.98
Maximum 1.5087e+06
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • distance is skewed right (γ1 = 4.7856)

Quantile Statistics

Minimum 1911.98
5-th Percentile 13565.106
Q1 32108.76
Median 57341.19
Q3 96745.59
95-th Percentile 186274.674
Maximum 1.5087e+06
Range 1.5068e+06
IQR 64636.83

Descriptive Statistics

Mean 74213.1063
Standard Deviation 67401.9739
Variance 4.543e+09
Sum 1.1881e+09
Skewness 4.7856
Kurtosis 56.074
Coefficient of Variation 0.9082
  • distance is not normally distributed (p-value 5.357451728443707e-14)
  • distance has 707 outliers

time

numerical

Approximate Distinct Count 14960
Approximate Unique (%) 93.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 644.1668
Minimum 13.25
Maximum 12110.2
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • time is skewed right (γ1 = 4.2509)

Quantile Statistics

Minimum 13.25
5-th Percentile 115.172
Q1 279.2
Median 507.05
Q3 845.73
95-th Percentile 1605.576
Maximum 12110.2
Range 12096.95
IQR 566.53

Descriptive Statistics

Mean 644.1668
Standard Deviation 559.8008
Variance 313376.9849
Sum 1.0312e+07
Skewness 4.2509
Kurtosis 48.7977
Coefficient of Variation 0.869
  • time is not normally distributed (p-value 3.5154539125719527e-13)
  • time has 670 outliers

code_단체권

numerical

Approximate Distinct Count 22
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 3.3873
Minimum 0
Maximum 21
Zeros 2926
Zeros (%) 18.3%
Negatives 0
Negatives (%) 0.0%
  • code_단체권 is skewed right (γ1 = 1.3898)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 1
Median 3
Q3 5
95-th Percentile 10
Maximum 21
Range 21
IQR 4

Descriptive Statistics

Mean 3.3873
Standard Deviation 3.2769
Variance 10.7381
Sum 54228
Skewness 1.3898
Kurtosis 2.15
Coefficient of Variation 0.9674
  • code_단체권 is not normally distributed (p-value 1.1979399709093542e-09)
  • code_단체권 has 476 outliers

code_일일권

numerical

Approximate Distinct Count 25
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 16.2675
Minimum 0
Maximum 24
Zeros 14
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • code_일일권 is skewed left (γ1 = -1.1774)

Quantile Statistics

Minimum 0
5-th Percentile 9
Q1 14
Median 17
Q3 19
95-th Percentile 21
Maximum 24
Range 24
IQR 5

Descriptive Statistics

Mean 16.2675
Standard Deviation 3.8109
Variance 14.5229
Sum 260427
Skewness -1.1774
Kurtosis 1.4631
Coefficient of Variation 0.2343
  • code_일일권 is not normally distributed (p-value 5.0036682936849037e-08)
  • code_일일권 has 408 outliers

code_일일권(비회원)

categorical

Approximate Distinct Count 8
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.0 MB
  • The largest value (1) is over 5.84 times larger than the second largest value (2)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 1
3rd row 2
4th row 0
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 16009
  • The top 2 categories (1, 2) take over 50.0%
  • The largest value (1) is over 5.84 times larger than the second largest value (2)
  • code_일일권(비회원) has words of constant length

code_정기권

numerical

Approximate Distinct Count 24
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 20.8934
Minimum 1
Maximum 24
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • code_정기권 is skewed left (γ1 = -1.8989)

Quantile Statistics

Minimum 1
5-th Percentile 16
Q1 20
Median 21
Q3 23
95-th Percentile 24
Maximum 24
Range 23
IQR 3

Descriptive Statistics

Mean 20.8934
Standard Deviation 2.5106
Variance 6.3031
Sum 334483
Skewness -1.8989
Kurtosis 6.2723
Coefficient of Variation 0.1202
  • code_정기권 is not normally distributed (p-value 7.947317778760553e-12)
  • code_정기권 has 610 outliers

sex_F

numerical

Approximate Distinct Count 27
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 13.1157
Minimum 0
Maximum 26
Zeros 10
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • sex_F is skewed left (γ1 = -0.4115)

Quantile Statistics

Minimum 0
5-th Percentile 7
Q1 11
Median 13
Q3 15
95-th Percentile 18
Maximum 26
Range 26
IQR 4

Descriptive Statistics

Mean 13.1157
Standard Deviation 3.3986
Variance 11.5503
Sum 209970
Skewness -0.4115
Kurtosis 0.5587
Coefficient of Variation 0.2591
  • sex_F is not normally distributed (p-value 6.980480528954399e-07)
  • sex_F has 304 outliers

sex_M

numerical

Approximate Distinct Count 25
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 14.6697
Minimum 0
Maximum 24
Zeros 1
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • sex_M is skewed left (γ1 = -0.3713)

Quantile Statistics

Minimum 0
5-th Percentile 10
Q1 13
Median 15
Q3 16
95-th Percentile 19
Maximum 24
Range 24
IQR 3

Descriptive Statistics

Mean 14.6697
Standard Deviation 2.7339
Variance 7.4743
Sum 234847
Skewness -0.3713
Kurtosis 1.2858
Coefficient of Variation 0.1864
  • sex_M is not normally distributed (p-value 9.899021982834499e-09)
  • sex_M has 605 outliers

sex_N

numerical

Approximate Distinct Count 26
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 13.8603
Minimum 0
Maximum 25
Zeros 5
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • sex_N is skewed left (γ1 = -0.4279)

Quantile Statistics

Minimum 0
5-th Percentile 8
Q1 12
Median 14
Q3 16
95-th Percentile 19
Maximum 25
Range 25
IQR 4

Descriptive Statistics

Mean 13.8603
Standard Deviation 3.1467
Variance 9.902
Sum 221890
Skewness -0.4279
Kurtosis 0.6957
Coefficient of Variation 0.227
  • sex_N is not normally distributed (p-value 1.551812362066209e-07)
  • sex_N has 219 outliers

age_20대

numerical

Approximate Distinct Count 12
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 6.7147
Minimum 0
Maximum 11
Zeros 8
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • age_20대 is skewed right (γ1 = 0.1809)

Quantile Statistics

Minimum 0
5-th Percentile 5
Q1 6
Median 6
Q3 7
95-th Percentile 9
Maximum 11
Range 11
IQR 1

Descriptive Statistics

Mean 6.7147
Standard Deviation 1.1434
Variance 1.3073
Sum 107495
Skewness 0.1809
Kurtosis 1.9932
Coefficient of Variation 0.1703
  • age_20대 is not normally distributed (p-value 6.489381237579253e-19)
  • age_20대 has 1575 outliers

age_30대

numerical

Approximate Distinct Count 12
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 6.2932
Minimum 0
Maximum 11
Zeros 9
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • age_30대 is skewed left (γ1 = -0.423)

Quantile Statistics

Minimum 0
5-th Percentile 5
Q1 6
Median 6
Q3 7
95-th Percentile 8
Maximum 11
Range 11
IQR 1

Descriptive Statistics

Mean 6.2932
Standard Deviation 1.109
Variance 1.23
Sum 100748
Skewness -0.423
Kurtosis 3.1604
Coefficient of Variation 0.1762
  • age_30대 is not normally distributed (p-value 7.481948592152977e-21)
  • age_30대 has 1278 outliers

age_40대

numerical

Approximate Distinct Count 12
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 6.2921
Minimum 0
Maximum 11
Zeros 15
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • age_40대 is skewed left (γ1 = -0.3336)

Quantile Statistics

Minimum 0
5-th Percentile 4
Q1 5
Median 6
Q3 7
95-th Percentile 9
Maximum 11
Range 11
IQR 2

Descriptive Statistics

Mean 6.2921
Standard Deviation 1.5516
Variance 2.4073
Sum 100730
Skewness -0.3336
Kurtosis 0.222
Coefficient of Variation 0.2466
  • age_40대 is not normally distributed (p-value 7.735393164779941e-14)
  • age_40대 has 59 outliers

age_50대

numerical

Approximate Distinct Count 10
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 5.1407
Minimum 0
Maximum 9
Zeros 25
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • age_50대 is skewed left (γ1 = -0.3458)

Quantile Statistics

Minimum 0
5-th Percentile 3
Q1 4
Median 5
Q3 6
95-th Percentile 7
Maximum 9
Range 9
IQR 2

Descriptive Statistics

Mean 5.1407
Standard Deviation 1.3451
Variance 1.8092
Sum 82297
Skewness -0.3458
Kurtosis 0.3409
Coefficient of Variation 0.2616
  • age_50대 is not normally distributed (p-value 1.1365911381643438e-14)
  • age_50대 has 25 outliers

age_60대

categorical

Approximate Distinct Count 9
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 1.0 MB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 0
3rd row 2
4th row 0
5th row 2

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 16009
  • The top 2 categories (3, 4) take over 50.0%
  • age_60대 has words of constant length

age_70대이상

categorical

Approximate Distinct Count 8
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.0 MB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 1
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 16009
  • The top 2 categories (1, 0) take over 50.0%
  • age_70대이상 has words of constant length

age_~10대

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 5.7266
Minimum 0
Maximum 10
Zeros 175
Zeros (%) 1.1%
Negatives 0
Negatives (%) 0.0%
  • age_~10대 is skewed left (γ1 = -0.6703)

Quantile Statistics

Minimum 0
5-th Percentile 2
Q1 5
Median 6
Q3 7
95-th Percentile 8
Maximum 10
Range 10
IQR 2

Descriptive Statistics

Mean 5.7266
Standard Deviation 1.9208
Variance 3.6893
Sum 91677
Skewness -0.6703
Kurtosis 0.1984
Coefficient of Variation 0.3354
  • age_~10대 is not normally distributed (p-value 4.334025827022454e-12)
  • age_~10대 has 522 outliers

age_기타

numerical

Approximate Distinct Count 13
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 250.1 KB
Mean 6.7898
Minimum 0
Maximum 12
Zeros 39
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • age_기타 is skewed left (γ1 = -0.9342)

Quantile Statistics

Minimum 0
5-th Percentile 3
Q1 6
Median 7
Q3 8
95-th Percentile 9
Maximum 12
Range 12
IQR 2

Descriptive Statistics

Mean 6.7898
Standard Deviation 1.6705
Variance 2.7905
Sum 108698
Skewness -0.9342
Kurtosis 1.9034
Coefficient of Variation 0.246
  • age_기타 is not normally distributed (p-value 3.787453349391683e-18)
  • age_기타 has 434 outliers

Interactions

Correlations

Missing Values